Proper Names in Dialogs from the Warsaw Transportation Call Center
نویسندگان
چکیده
In the paper we present the method of automatic recognition and annotation of proper names which occur in dialogs gathered at the Warsaw city transportation information center. We describe different types of proper names and how people use them in dialogs. We present rules of automatic recognition and lemmatization of proper names in the transportation domain.
منابع مشابه
Design and Data Collection for Spoken Polish Dialogs Database
Spoken corpora provide a critical resource for research, development and evaluation of spoken dialog systems. This paper describes the telephone spoken dialog corpus for Polish created by Polish-Japanese Institute of Information Technology team within the LUNA project (IST 033549). The main goal of this project is to create a robust natural spoken language understanding (SLU) toolkit, which can...
متن کاملSyntactic Structure of Polish Proper Names of Places
The aim of this presentation is to introduce a syntactic analysis of Polish proper names of places. This study focuses on names of places in Warsaw, mainly places that have a postal address. We will introduce briefly the domain of our interest and give statistics of the collected data. Then, we will review types of syntactic structures that have the highest frequency in our corpus. The goal of ...
متن کاملIndefinite Proper Names and Quantification over Manifestations
Starting from the observation that in German proper names can be determined by the indefinite article without apparently shedding their original referent, this paper presents a range of “nonstandard” uses of proper names, among them one we will call the “manifestation use”. It will be suggested that certain occurrences of proper names cannot be adequately analysed without assuming manifestation...
متن کاملEmotion Detection in Task-oriented Spoken Dialogs
Detecting emotions in the context of automated call center services can be helpful for following the evolution of the human-computer dialogs, enabling dynamic modification of the dialog strategies and influencing the final outcome. The emotion detection work reported here is a part of larger study aiming to model user behavior in real interactions. We make use of a corpus of real agent-client s...
متن کاملToposław - A Lexicographic Framework for Multi-word Units
The paper presents a tool for the creation of an electronic dictionary of multi-word proper names. Toposław uses graphs for the representation of inflectional and pragmatic variants of names. It cooperates with Morfeusz, a morphological analyser and generator for Polish words, and Multiflex, a cross-language morpho-syntactic generator of multi-word units. Our goal was to create a userfriendly t...
متن کامل